Identifying Unknown Unknowns in the Open World: Representations and Policies for Guided Exploration
نویسندگان
چکیده
Predictive models deployed in the world may assign incorrect labels to instances with high confidence. Such errors or unknown unknowns are rooted in model incompleteness, and typically arise because of the mismatch between training data and the cases seen in the open world. As the models are blind to such errors, input from an oracle is needed to identify these failures. In this paper, we formulate and address the problem of optimizing the discovery of unknown unknowns of any predictive model under a fixed budget, which limits the number of times an oracle can be queried for true labels. We propose a model-agnostic methodology which uses feedback from an oracle to both identify unknown unknowns and to intelligently guide the discovery. We employ a two-phase approach which first organizes the data into multiple partitions based on instance similarity, and then utilizes an exploreexploit strategy for discovering unknown unknowns across these partitions. We demonstrate the efficacy of our framework by varying the underlying causes of unknown unknowns across various applications. To the best of our knowledge, this paper presents the first algorithmic approach to the problem of discovering unknown unknowns of predictive models.
منابع مشابه
Comparison Between Open and Ultrasonography Guided Venous Access Ports in Children with Malignancy
Background: Long-term central venous access is used in children for various reasons specially for delivering chemotherapy. Since vessels in children have smaller diameters, they are more prone to injury and complications such as thrombosis. Different methods are used for implantation of port-a-cath in children. We aimed to compare the complications of insertion of central venous access ports be...
متن کاملDiscovering Unknown Unknowns of Predictive Models
Predictive models are widely used in domains ranging from judiciary and healthcare to autonomous driving. As we increasingly rely on these models for high-stakes decisions, identifying and characterizing their unexpected failures in the real world is critical. We categorize errors of a predictive model as: known unknowns and unknown unknowns [3]. Known unknowns are those data points for which t...
متن کاملSociological Analysis of the Rights of Children with Disabilities: Policies of Iran, the Islamic World, and the International Sphere
This study seeks to identify, analyze and strengths of policies related to children with disabilities in Iran, the Islamic world and the international level From Sociological Point of view that can finally provide interventions to improve the situation of children with disabilities in Iran. Research method is qualitative and research approaches are also exploratory and content analysis. After i...
متن کاملThe Exploration of Protective factors on prevention working children’s substance abuse
BackgroundChild labor is one of the challenges among most big cities in the world. In recent years, substance abuse among working, and street children has become a common phenomenon. Thus, in the present study, the protective factors affecting the prevention of substance abuse among Iranian working children were identified by using the social-ecological approach.Materials and MethodsThe partici...
متن کاملGender Concept “Woman” in the Minds of the Russian People (Taking the Chinese as Reference) According to an Associative Experiment
The article is devoted to the study of language representations of the concept of “woman” in the minds of the Russian and Chinese people based on a comparison of associative experiments of two languages, identifying the dynamics of the concept in the language consciousness of the people, establishing the specificity of the concept in the Russian language picture of the world referring to the Ch...
متن کامل